NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

An Investigation on LLMs' Visual Understanding Ability Using SVG for Image-Text Bridging

https://doi.org/10.1109/WACV61041.2025.00525

Cai, Mu; Huang, Zeyi; Li, Yuheng; Ojha, Utkarsh; Wang, Haohan; Lee, Yong Jae (February 2025, IEEE)

Free, publicly-accessible full text available February 26, 2026
A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance

https://doi.org/10.1109/ICCV51070.2023.01073

Huang, Zeyi; Zhou, Andy; Lin, Zijian; Cai, Mu; Wang, Haohan; Lee, Yong Jae (October 2023, IEEE)

Full Text Available
The Two Dimensions of Worst-case Training and Their Integrated Effect for Out-of-domain Generalization

https://doi.org/10.1109/cvpr52688.2022.00941

Huang, Zeyi; Wang, Haohan; Huang, Dong; Lee, Yong Jae; Xing, Eric P. (June 2022, Conference on Computer Vision and Pattern Recognition (CVPR))

Training with an emphasis on “hard-to-learn” components of the data has been proven as an effective method to improve the generalization of machine learning models, especially in the settings where robustness (e.g., generalization across distributions) is valued. Existing literature discussing this “hard-to-learn” concept are mainly expanded either along the dimension of the samples or the dimension of the features. In this paper, we aim to introduce a simple view merging these two dimensions, leading to a new, simple yet effective, heuristic to train machine learning models by emphasizing the worst-cases on both the sample and the feature dimensions. We name our method W2D following the concept of “Worst-case along Two Dimensions”. We validate the idea and demonstrate its empirical strength over standard benchmarks.
more » « less
Full Text Available
Virus-assisted directed evolution of enhanced suppressor tRNAs in mammalian cells

https://doi.org/10.1038/s41592-022-01706-w

Jewel, Delilah; Kelemen, Rachel E.; Huang, Rachel L.; Zhu, Zeyu; Sundaresh, Bharathi; Cao, Xiaofu; Malley, Kaitlin; Huang, Zeyi; Pasha, Muhammad; Anthony, Jon; et al (January 2023, Nature Methods)

Full Text Available
Toward Learning Human-aligned Cross-domain Robust Models by Countering Misaligned Features

Wang, Haohan; Huang, Zeyi; Zhang, Hanlin; Lee, Yong Jae; Xing, Eric P. (January 2022, Conference on Uncertainty in Artificial Intelligence (UAI))

Machine learning has demonstrated remarkable prediction accuracy over i.i.d data, but the accuracy often drops when tested with data from another distribution. In this paper, we aim to offer another view of this problem in a perspective assuming the reason behind this accuracy drop is the reliance of models on the features that are not aligned well with how a data annotator considers similar across these two datasets. We refer to these features as misaligned features. We extend the conventional generalization error bound to a new one for this setup with the knowledge of how the misaligned features are associated with the label. Our analysis offers a set of techniques for this problem, and these techniques are naturally linked to many previous methods in robust machine learning literature. We also compared the empirical strength of these methods demonstrated the performance when these previous techniques are combined, with implementation available here
more » « less
Full Text Available
Enhanced Directed Evolution in Mammalian Cells Yields a Hyperefficient Pyrrolysyl tRNA for Noncanonical Amino Acid Mutagenesis

https://doi.org/10.1002/anie.202316428

Jewel, Delilah; Kelemen, Rachel_E; Huang, Rachel_L; Zhu, Zeyu; Sundaresh, Bharathi; Malley, Kaitlin; Pham, Quan; Loynd, Conor; Huang, Zeyi; van_Opijnen, Tim; et al (January 2024, Angewandte Chemie International Edition)

Abstract Heterologous tRNAs used for noncanonical amino acid (ncAA) mutagenesis in mammalian cells typically show poor activity. We recently introduced a virus‐assisted directed evolution strategy (VADER) that can enrich improved tRNA mutants from naïve libraries in mammalian cells. However, VADER was limited to processing only a few thousand mutants; the inability to screen a larger sequence space precluded the identification of highly active variants with distal synergistic mutations. Here, we report VADER2.0, which can process significantly larger mutant libraries. It also employs a novel library design, which maintains base‐pairing between distant residues in the stem regions, allowing us to pack a higher density of functional mutants within a fixed sequence space. VADER2.0 enabled simultaneous engineering of the entire acceptor stem ofM. mazeipyrrolysyl tRNA (tRNA^Pyl), leading to a remarkably improved variant, which facilitates more efficient incorporation of a wider range of ncAAs, and enables facile development of viral vectors and stable cell‐lines for ncAA mutagenesis.
more » « less

Search for: All records